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Abstract 

The aim of this paper is to establish rates of convergence to Gaussianity 
for wavelet coefficients on circular Poisson random fields. This result is 
established by using the Stein-Malliavin techniques introduced by Peccati 
and Zheng (2011) and the concentration properties of so-called Mexican 
needlets on the circle. 
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1 Introduction 

1.1 Motivations 

This work is concerned with the study of quantitative central limit theorems 
for linear statistics based on wavelet coefficients computed on circular Poisson 
random fields. In particular, we are referring to the very remarkable advances 
provided in this area by the combination of two probabilistic methods, the Malli¬ 
avin calculus of variations and the Stein’s method of approximations. The inter¬ 
action of these methods is successfully applied to exploit rates of convergence 
of the asymptotic normal approximation for functionals of Gaussian random 
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measures (see ESI 133), for functionals of general Poisson random measures (cfr. 
EH EH] and 13) and, more recently, to fix convergence criteria from the point 
of view of spectral theory of general Markov diffusion generators (see El EH]). 
These results have being used in a growing range of applications, see for in¬ 
stance na Enina: we mention as textbook reference EH] while the webpage 
http://www.iecn.u-nancy.fr/~nourdin/steinmalliavin.htm contains updates on 
the researches on this field. 

The main object of our investigation concerns the application of these tech¬ 
niques to the framework of wavelet coefficients computed over samples of circular 
data. These data correspond to the measures of angles labelled by a given ori¬ 
gin, i. e. a starting point on S 1 , and a given positive direction. From the 
theoretical point of view, the datasets on S 1 are characterized by a periodicity 
property, with period 2n. As consequence, a lot of interest has been recently 
raised by the development of statistical methods on the circle, also in view of 
their applications in many different sciences, as for instance geophysics, cos¬ 
mology, oceanography and engineering. A complete overview on this topic can 
be found, for instance, in the textbooks El EHl mi EH- Some more recent ap¬ 
plications can be found in |1, lOj GPU [2?| 071. In recent years (cfr. [2]), the 
literature concerning the unit g-dimensional sphere has made an extensive use 
of the construction of second-generation wavelets on the sphere, the so-called 
spherical needlets. Introduced in the literature by EH El] , spherical needlets 
are characterized by some main properties which makes them an excellent tool 
for statistical analysis, such as their concentration in both Fourier and space 
domains. Spherical needlets and some extensions were successfully applied to a 
large set of statistical problems, see for instance H El El HE]. 

Some assessments of quantitative Berry-Esseen bounds for statistics related 
to the needlet framework are already present in the literature: the multidimen¬ 
sional normal approximation of linear and nonlinear statistics based on needlet 
coefficients evaluated either on Poisson field or on vectors of i.i.d. observations 
over the sphere were studied, respectively, in J16j and in [8]. The wavelets taken 
into account here are instead the so-called Mexican needlets, built over a general 
compact manifold by D. Geller and A. Mayeli in miniEiEE]. A Mexican 
needlet ^j q - s is indexed by the shape parameter s, the resolution level j and 
by q , which indicates the region Ej q C S 1 on which the needlet is consistently 
different from 0. It can be roughly thought as a product between basis ele¬ 
ments computed on a suitable set of points Xj q £ Ej q and the Schwarz function 
w s : R i —> R+. The function w s leads to an exponential concentration property 
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in the spatial domain, stronger than the localization related to standard needlets 
(cfr. also pngHS]). Moreover, while the standard needlets are defined over a 
set of exact cubature points and weights (cfr. [33]) to have a tight frame, the 
Theorem 2.2 in [33 ] establishes that the frame obtained by the Mexican needlets 
built over a set of points under some weaker conditions (see [22j and Section [2] 
below) is nearly-tight. Various examples of statistical applications of Mexican 
needlets can be found, for instance, in 


1.2 Main results 

This work is concerned with quantitative rates of convergence to Gaussianity 
of Mexican needlet coefficients sampled over Poisson processes. Consider a se¬ 
quence of independent and identically distributed random variables {X*, i > 1}, 
taking values over S 1 so that bj q;s := E [if) • (Xi)] and (Tj q - S := E (Xi)]. 

Let us consider the independent Poisson process {N t : t > 0} on R with pa¬ 
rameter R t , which is monotonically increasing with t. Our purpose is to es¬ 
tablish conditions over the three sequences [j = j t : t > 1}, {q = q t : t > 1} and 
{R t : t > 1} so that, in the sense of the distance d 2 , the d t -dimensional vector 
Y t = (lt,ij •••; It.czJ, where 


Y tti := 


yfRt 


E 


^3t1f,s (-^i) bj tqt - y 


( 1 ) 


is asymptotically close to a Gaussian d t -dimensional standard Gaussian random 


vector Zd t . These results are stated in Theorems 3.1 and 3.2 Similar results 
were obtained on S 2 by using standard needlets, cfr. m ■ Furthermore, we study 
also the so-called ’de-Poissonized’ case, where the data are i.i.d. over S 1 and for 
which we will establish a quantitative central limit theorem, cfr. Proposition 
|3.3| We will also propose a case study concerning the nonparametric density 
estimation, which can be considered as a completion of our previous work [IB] . 

From the technical point of view, the proofs of the main theorems follow 
strictly the guidelines driven for this kind of application by m~ On the other 
hand, the ancillary results concerning the asymptotic behaviour of the covari¬ 
ance matrix of Mexican needlet coefficients sampled on Poisson random pro¬ 
cesses are also of interest. They are obtained by using the localization property 
of Mexican circular needlets developed in ESI- 
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1.3 Plan of the paper 


The Section [2] introduces some preliminary notions, such as Mexican needlets 
systems and their properties and the general results on Stein-Malliavin bounds 
from ;38l [39]. The Section [3] presents the statement of our main results on 
the rate of convergence in the Gaussian approximation of linear statistics of 
Mexican needlet coefficients by Stein-Malliavin techniques. The Section [4] is 
concerned with the proofs of the main theorems and of the auxiliary results. 
The Section [5] studies an application to the framework of the nonparametric 
density estimation of the results in the Theorem 3.2 The Section [6] contains 
some numerical evidence. 


2 Preliminary results 

2.1 The Mexican needlet framework 

In this section, we will introduce the construction of the Mexican needlets over 
the unit circle S 1 : these wavelets were introduced in the literature by D. Geller 
and A. Mayeli in |20l 21, [22j [23]. We will start with a quick overview on 
the Fourier analysis over the circle: let L 2 (S 1 ) = L 2 (S 1 ,dp) be the space of 
square integrable functions over the circle with respect to the uniform Lebesgue 
measure p (dO) := (27t) -1 dd. As well-known in the literature, the set of functions 
{uk (6) ,9 £ S 1 , k € Z}, Wfc ( x ) = exp (ikO), is an orthonormal basis over S 1 . For 
/ e L 2 (S 1 ) , we define the Fourier transform as 

1 f _ 

: = ^ J f (9) u k ( 9)d9 , 

while the Fourier expansion is given by 

f(9) =J2 akUk ^ - dGgl ■ (2) 

k^’L 

Observe that u k is the eigenfunction of the circular Laplacian A corresponding 
to eigenvalue —fc 2 , further details can be found in the textbook[44], see also [3l| . 

Consider now the function w s : M —> M+, named weight function (cfr. Figure 
[l]) and defined as 

w s ( x ) := X s exp (— x) ,iel. (3) 
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weight function - s=2 



Figure 1: the weight function w s for s = 2. 


Following [20], from the Calderon formula and for t £ R + , we define 

2 dx r (2s) 


OO r 


2 2s 


on the other hand, (see [22]) fixing the scale parameter B > 1, from the 
Daubechies’ Condition it follows that 


A B , s m B < ^2/ 2j )| <A B>S M B , 


J=-oo 


where A B , S = e s (2 log B) \ M B = (l + 0 B (\B - 11 2 log | B - 1|^ and m B = 
(l — O b (|11 2 log | A? — 11) ) . 

Fixed the resolution level j G Z, consider a partition of S 1 {Ej q : q = 1, Qj} 
such that for any qi ^ q 2l Ej qi D E ]tj2 = 0. The region Ej j. can be described 
in terms of the couple (A j q ,Xj q ): the positive constant A j q := p(Ej q ) is the 
length of Ej q , and Xj q £ Ej q is a generic point belonging to Ej q . For the sake 
of simplicity, from now on we will consider Xj q as the midpoint of the segment 
of arc Ej q . Fixing the shape and the scale parameters s £ N and B > 1 (cfr. 
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Mexican needlet-s=2; B=1.3. j=10 



Figure 2: the Mexican needlet corresponding to s = 2, j = 10 and B = 1.3. 


Figure [2]), the circular Mexican needlet ipj q - 3 : S 1 >->• C is defined as 

OO 

i>jq-,s( 9 ) : = V\jq Ws {( B ~ Jk ) 2 ) U k (Xjq)u k ( 6 ) 

k— — 00 
00 

= X! Ws exp (ik {9 - x ju )) , 6 £ S 1 . (4) 

k— — 00 

For any F £ L 2 (S 1 ), the needlet coefficient f3j q - s £ C associated to - is 
given by 

The system {'(/j jg;9 }, under some regularity conditions, describes a nearly- 
tight frame over S 1 , as proved by the Theorem 1.1 in [22] (for general manifold). 
A set of functions {gi,i > 1} over a manifold M is said a frame if there exist 
two positive constant C\ and C 2 (the tightness constants) such that for any 
F £ L 2 (M) 


ci 11*11 L 2 (M) — Er> L 2 (M) 
i 


— C 2 \\ F \\l 2 (M) ' 


The frame is tight if c = C\ = C 2 . If the frame is tight, it is characterized by a 
reconstruction formula , i.e. for anyA £ L 2 (M), the following equality holds in 
the L 2 -sense: 
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F = —f Xi ) i 2 ^ > 


which can be roughly viewed as the counterpart of the harmonic expansion 
in the wavelet framework. As example, we recall that the standard spherical 
needlets describe a tight frame over the d-dimensional sphere § d , cfr. [33, 341 ; 
a frame is nearly-tight if C2/C1 ~ 1 + e, e sufficiently close to 0. In this case, 
a reconstruction formula does not hold anymore, but it is possible to build a 
summation formula, such that 



The bias B e is smaller as e is closer to 0, cfr. |T 8 ]. Following Theorem 1.1 in 
[22] , we have that, fixing B > 1 and Cq,So > 0 sufficiently small, there exists 
a constant C 0 as follows: (i) for the pixel parameter 77 € (0,1) and for j £ Z, 
there exists a set of measurable sets {Ej q , q = 1,..., Qj}, with Xj q < qB - ^ and 
for each j with qB~^ < 8q, \j q > cq (r/F? - - 7 ) for q = 1 , ; (ii) it holds that 


OO 


{A B ,sm B - C 0 rf) ||F||^ 2 (S i) < ^ ^\/3 jq . s \ 2 < (A BtS M B + C 0 ri) \\F\\ 2 L2{§1) . 


( 6 ) 


If (A B , s m B — Coq) > 0, it follows {'fj q[s } is a nearly tight frame, since 



Remark 2.1 Mexican needlets present some remarkable advantages if com¬ 
pared to the standard needlet systems, see for instance JM El EP and the text¬ 
book m first of all, they feature a stronger concentration property in the real 
domain, cfr. JZ3 oa nsy. Then, they do not need an exact system of cuba- 
ture points and weights but they can be built over a more general partition given 
by {Ej q ,q= 1 cfr. On the other hand, they present also some 

disadvantages: spherical needlets are characterized by a compact support in the 
frequency domain (cfr. 1331 \$$), while Mexican needlets are defined over the 
whole frequency range. Furthermore, as already mentioned, standard needlets 
describe a tight frame and therefore they enjoy an exact reconstruction formula, 
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which is lacking in the Mexican needlet framework. The former issue is “em¬ 
pirically” compensated by the form of the function w s , strongly localized around 
a dominant term in the frequency domain and very close to zero out of a very 
limited set of frequencies, substantially equivalent to the compact support of the 
standard needlets. As far as the latter issue is concerned, the summation for¬ 
mula, the counterpart of the reconstruction formula, is characterized by a bias 
which is easily controlled by the user given the nearly-tightness of the frame (cfr. 

mi 

From now on, we will consider just positive resolution levels j. In order to 
respect the conditions of nearly-tightness, we impose that, for j > 0 , 

Qj » V~ 1 B j , Xjq » VB-i . (7) 


The Mexican needlets localization property can be stated as follows: for any 
s £ N, there exists c s such that 


V’jfejs ( 0 )| < c s Bl exp 


^(e- Xjk ) ^ ^ | ^(9- Xjk ) y^ 


cfr. Tp] [IF; [22] • The localization property leads to very relevant boundedness 
rules on the L p -norms: there exist c p ,C p > 0 such that 

c p B < ||^,J^ (§1) < Cp^B^- 1 ) , (8) 


cfr. [TK1HH] , 


2.2 Normal approximations and Stein-Malliavin bounds 

This section provides a quick overview on the asymptotic Gaussianity of linear 
functionals of Poisson random measures, properly adapted to the unit circle 
S 1 and initially introduced in |58J SI] • Here, we follow strictly the analogous 
findings developed on the sphere S 2 in [16 . Further general discussions and 
more technical details can be found also in [36, 40]. Let us begin this section 
by introducing some distances between laws of random variables, standard in 
the literature, which define topologies strictly stronger than the convergence in 
distribution. While the former, the Wasserstein distance, is used in univariate 
case, the latter, the (^-distance, is exploited in the multivariate case. Let g G 




C (R 9 ): its Lipschitz norm is given by 


llffll Lip = SU P 

,x^y 


Iff (x) - 9 (y)\ 

Ik-fflU 


for g £ C 2 (R 9 ), let M 2 (g) be given by 

M 2 (g) = sup ||-ffess (g (x))|| , 

y&S. q 

where ||-|| denotes the operator norm. 

Definition 2.1 Let X,Y be two random vectors with values on R 9 , q > 1, such 
that E [|| A'Hjj,] , E [|| Y]| R ,] < oo. The Wasserstein distance dw between the laws 
of X and Y is given by 


d W (X, Y) = sup |E [g (X) - g (Y )] | . 

S:||9lliip<l 

Definition 2.2 Let X,Y be two random vectors with values on R 9 , q > 1, such 
that E [||A'|| r? ] , E [||y|| R9 ] < oo. The distance d 2 between the laws of X and Y 
is given by 


d 2 (X, Y) = sup |E[ 5 (A)- ff (F)]| . 

g:ll9ll Lip <i,M 2 (g)<i 

We recall now the definition of Poisson random measures (cfr. for instance 

Definition 2.3 Let (0,A,/u) be a a-finite measure space, with no-atomic g. 
The collection of random variables {N ( A ) : A € .4} taking values on Z + U {oo} 
is a Poisson random measure (PRM) on 0 with control (intensity) measure p 
if the following conditions hold: 

1. For every element A £ A, N (A) has Poisson distribution with parameter 

2. If Ai, A 2 ,..., A n £ A are pairwise disjoint, then N (Ai),N (A n ) are 
independent. 

Remark 2.2 In our case, we choose 0 = R+ x S 1 , with A = T (0), the Borel 
subsets of 0; N corresponds to a Poisson random measure on 0, governed 
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by the intensity p = r x v. As far as r is concerned, we have that the map 
R t := r (l[o,t]) is strictly increasing and divergent as t —>■ oo and t({0}) = 0. 
The probability measure on the unit circle v is absolutely continuous with respect 
to the Lebesgue uniform measure, i. e. v (dd) = F ( 9 ) p(dd). From now on, F 
is bounded away from zero, i. e. there exist two constants M <*,, M$ > 0 such that 


0 < M 0 = inf F(0) < F (6) < sup IF (0)1 = M^ < oo , 0 € S 1 . (9a) 

ees 1 egS i 

It follows that, for any fixed t > 0, the mapping 


A i y Nt (A) —: N (l[o,t] x A) 


corresponds to a PRM over S 1 with control 

p t ( dO) = R t x v ( dO ) = R t x F (0) d8 , 

cfr. Point (i), Remark 2.f in f l@j . Furthermore, as stated in Point (ii), Remark 
2.f in w, given {Xi,i > 1}, the sequence ofi.i.d. random variables on S 1 with 
distribution v, for any t > 0, it holds the identity in distribution between the 
two mappings A > N t (A) and A (^)> where 6. (•) is the Kronecker 

delta function and N t is an independent Poisson random variable with intensity 


Rt- 


For any kernel / £ L 2 (0, p) D L 1 (0, p), with the notation N (/) and N (/) 
we will denote respectively the Wiener-Ito integrals with respect to N and the 
corresponding compensated measure N (^4) = N ( A ) — p (A), A £ F (0), with 
the convention N (A) — p (A) = oo whereas p (A) = oo. Furthermore, the 
following isometry property holds: for every f,g £ L 2 (0,/i), 



More details can be found in |40) . 

Finally, we present to rate of convergence to Gaussianity of Wiener-Ito inte¬ 
grals with respect some compensated measure N obtained by the combination 
of the Stein’s method for probabilistic approximations and the Malliavin calcu¬ 
lus of variations, involving random variables lying in the first Wiener chaos of 
N. These results are here properly adapted to S 1 . The first bound, concern¬ 
ing normal approximation in dimension 1 and the Wasserstein distance, was 
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introduced in [35], while the second one, for the d-dimensional case, d > 1, was 
introduced in ]35] . 


Proposition 2.1 Let h £ L 2 (S 1 ,/^) fl L 3 (S 1 , /x t ), let Z ~ AT (0,1) and let 
t > 0, then the following bound holds 


d w (N t ( h ), 


Z) < 


1 - 




+ 


/S 1 


|/i(0)!V t (dd) 


(10a) 


Therefore, if Hindoo II^IIl^s 1 ,^) = 1 andlim t -^oo IWIzcqs 1 ,/^) = ho ^ s that 

N t (,h ) Z . 


For d > 1, Zet .Z^ ~ A/"d (0, ±), where E is a d-dimensional positive-definite 
covariance matrix, h\,...,hd £ L 2 (S 1 ,/^) fl L 3 (S 1 ,/^); ZeZ 

G t := (fV t (/n),...,fV t (M) , 

associated to a covariance matrix Cd whose elements are given by 


Cd (Zi,Z2) — IE Nt (hi ,) Ay (Zq 2 ) — (h'ii, hi 2 )^2 'j > h,h — . 

Hence it holds that 


MGt,Z d ) < ||E- 1 || op ||E||| p ||E-C d || ffS .+ 


v/2tt 


|S —1 1| 2 IIEII 

I Nop 11 11 op 


d „ 

X £ / \h n (8)\\h l2 (6)\\h i3 (e)\p, t (d8) , ( 11 ) 

U,i a ,i 3 = 1 JS1 

where ||-|| and ||-|| ff s denote respectively operator and Hilbert-Schmidt norms. 


3 Rates of convergence to Gaussianity of Mexi¬ 
can needlet coefficients 

In this section will study the asymptotic behaviour of the means of the Mexican 
needlet coefficients, establishing two quantitative central limit theorems and 
their explicit rates of convergence. Under the assumptions stated in the previous 
section, let {A'i, i > 1} be a sequence of i.i.d. random variables with distribution 
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v, independent of the Poisson process N t (S 1 ). Consider the following kernel 

( 12 ) 

where £ L 1 (S 1 ,^) n L 2 (S 1 ,^) n L 3 (S 1 ,^). Let b jq , s := E [^ g;s (Xi)] 


hffi (0) ■■= r ^ A ° } , 0 £ S 1 , 

V Rt a jq-,s 


and 0 j g;s := E [^ g;g (Xi)]. Observe that, using (9a) and (|8j), 

0 < M 0 r]C 2 < M 0 ||^; S ||i 2 (sl) < Vj q;a < M oo ||^ig;*||r 2 rsi'l ^ M °°11 C 2 < 00 


■?«;»! 1 1,2 (§i) 


(13) 


Let us write 


S? ■■= Nt (h.™) = h{ £ W - R t J S1 h{ S W " W 


06supp(AT t ) 


or, in view of the Remark 2.2 


_ X)i=l ^ V(;g;s (Ni) Rtbjq-s 

Pja:s 


VRt< 


It is immediate to see that E 
define the d-dimensional vector 


Pi 


(Rt) 


jq;s 


■t&jq;s 


= 0, E 


(R t ) 

h'jq-^s 


Y = , «(«*) 

1 \ ■■■ll J jq d ',s I ! 


(14) 

= 1. Let us finally 

(15) 


while each element of its covariance matrix is given by 


T t,j,s ( qh,qi 2 ) ■= E 


~(R t ) ~(R t ) 


, *i, *2 = 1, d . 


(16) 


Let Z ~ AT (0, Id ), where Id is the d-dimensional identity matrix. Hence the 
following results holds. 


_ 

Theorem 3.1 Let /3 ■ by given by (14)■ there exist Cq such that 


d w [p { *? s ,Z) < Co (B~ j R t y 


Furthermore, if (B J R t ) 2 = o t (1), we have 'Pj q * a —>d Z. 
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Theorem 3.2 Let Y t by given by (15). there exist Ci,C 2 such that 
d 2 (Y t , Z d ) < C' 2 exp B 23 (x jqi - x jq2 ) 2 ) (l + B 2sj (. 


\%ini % i 


JQ 1 u '3Q2) 


+C' 2 d t (B-’Rt) 


1 

2 


Furthermore, if dt (B 3 Rt) 2 = Ot (1), ute have Y t ~^ d Z d . 

Observe that in both the cases we have the central limit theorem the Mexican 
needlet coefficients converge in distribution to the (univariate and multivariate) 
Gaussian distribution when B 3 = B 3 ' = o(R t ). 

Remark 3.1 Recall that the object B~ 3 R t can viewed as the effective sample 
size of the Mexican needlet (cfr. Remark f.3 in Indeed, B~ 3 can be 

thought as the ’effective’ scale of the wavelet Le. the dimension of the 

region Ej q . Hence, we obtain 

E [card {Xj : d (X,;, Xj q ) < B~ 3 }] ~ R t [ F (9) dO , 

Jd(Xi,x jq )<B~j 


where 

M 0 B~ j R t <Rt [ F (9) dd < M 00 B~ 3 R t . 

In what follows, before concluding this section, we will prove that the explicit 
bounds deduced in the Theorems 13. II and 13.21 can be also extended to the case of 
linear statistics based on vector of i.i.d. observations rather than on a Poisson 
measure, by paying the price of an additional factor proportional to d ( n ) n~i. 
Let us define the de-Poissonized vector 


Y' = - 

n /— 

Jn 


1 (Yhi-l V’j(n)qri;s C^i) Si=l ^j{n)q d ( n y,s i^-i) 




a o{n),q d(jl y,s 


and let us recall the following result from (TBj (Lemma 1.1). 

Proposition 3.3 Let R{n ) = n and consider Xi,i > 1, as random variables 
uniformly distributed over S 1 . Then, there exists a constant M d p > 0 such that 
for every n and every Lipschitz function f : R d ( n ) 1— > R, the following inequality 
holds 

\E[f(G' n )-f(G n )}\<M dP \\f\\ Lip ^$. 

n 4 

As consequence, there exists CdP > 0 such that following upper bound holds 
d 2 (G n ,Z d(n) ) < C dP ((*-*»>„)“* + d(n) . 
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4 Proofs 


In this Section we describe extensively the proofs of the Theorems |3.1| and |3.2 
and of some auxiliary results. 


4.1 Proofs of Theorems 13. II and 13.21 

Here we present the exhaustive proofs of the Theorems|3.1|3.2[ obtained by using 


the explicit kernel (121 in the Proposition 2.1 and exploiting the properties of 


the Mexican needlets such as 


Proof of the Theorem 3.1 Using (12) and (13) in (10a), we have 


1 - 


L (Rt) 

l jq;s 


LH SVt) 


= 0 


while 


/ 

J s 1 


MtW = 


| i’jg-,8 ( 0 )| 


Is 1 R? a 3 

J H u jq-s 


-Ht ( d0) 


< 


< 


< 


r: 


[ K'g;s( 6 ’)| 3 *' W 

J s 1 


(M 0 ?7c 2 ) 5 J si 

T || V’jqja (^)|li3(gi 


Rt ~ 2 M 0 

( M 0 r)c 2 ) 

{M 0 c 2 )i 


(RtB 


-J) 2 


where in the last equality we have used (|8|). ■ 

As far as the multivariate case is concerned, we obtain the following results. 
Proof of the Theorem \3.2\ By definition, we have 


lop = UdWop = 1 


while, following the Lemma 4.1 it holds that 

\\!d ~ lli/.s. 
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< 


E E 

~~(R t ) ~(R t )' 

6 B 

^ jqi'.sC jq 2 ;s 

91=92 

L J 


< d sup 


-Moo 


'jq i ^jq 2 ^ 


gi /g 2 = l,...,d Af 0 C 2 


On the other hand, the Lemma 


4.2 


^1 + (-B J (Xj qi x jq 2 )) 

states that there exists G > 0 such that 


E 

* 1 ,* 2,*3 = 1 ' 


(#) ^g i2 ; S ( 9 ) ( 9 ) Tt W < CR t dB ? . 


Hence, we have that 


E J sl | h ™,s (°) | | W | | h^. a (9) \n t (dO) 


* 1 ,* 2,*3 = 1 


< 


r: 


( M 0 r]c 2 Y 


d „ 

E / ^jq i3 -s(9) fl t (d9) 

,i 2 ,i 3 =l JS1 

dt (RtB-1)- 1 * CM «, 


(M 0 C 2 ) 5 

as claimed. ■ 

Remark 4.1 4s far as the dimension d is concerned, the bound proposed in the 


Theorem 3.2 holds both if d is fixed and if d = dt grows with t, see also fT&j . In 

- _1 

the former case, the bound obtained depends only on yB~RRf) 2 , while in the 

latter we obtain 


d w \^)J,Zj<Cd t (B-RR t ) 3 . 

To attain the convergence in distribution, it suffices that d t = o t \B~ H Rt) 


4.2 Auxiliary results 

This subsection includes the proofs of the auxiliary Lemmas, mainly related to 
asymptotic behaviour of the covariance matrix given in (161 


Lemma 4.1 For any j > 0 and 1 < qi ^ <72 < Qj> , there exists a constant 
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> 0 such that 


E 


X(«t) -rXRt) 

*jqi;aP jq 2 ;s 


< Q exp (-.B 2 -? (x j9l - a; jg2 ) 2 ) (l + (a; jgi - x jq2 )) 2s ^j . 


Proof. We have that 




' 


E 

@jqusfijq2-,s 


— 


1 


Rt cr jqi;s cr jq2;s J s 1 
1 


' ]qi,s^ jq 2 ,s 


is 1 


i’jqUsiO) ^jq 2 ',s W Mi W 

V> i9i;s 


< 






r?Mo^ v|TJ9i;s| ’ i r J®;»i/i a (si) • 

Analogously to [16], we split S 1 into two regions: 

51 = £ S : (9 — Xj qi ) > ( Xj qi — Xj q2 ) /2^ 

5 2 = £ S : (9 — Xj q2 ) > ( Xj qi — Xj q2 ) /2| . 

so that. 

(l^jqi'A > |V’jg 2 ;s|)i / 2(si) — f l^j?i;s WVW (fl)| dfl+ / (0)^ ;a (0)1 d0 

*/ Si */ 1 S 2 

From the localization property, it follows that 

/ |V’jg l ; S (0)^g 2 ; S (0)|d0 

JSi 
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< (c'J 2 V ex P ( -B 2j (Xjq 1 - z?g 2 ) 2 ) (l + B 23 (x jqi - x jq2 ) 2s ^ 


[ exp (-B* (0 - x jq2 ) 2 ) (l +(bV(0- 

J S i 




It is immediate to see 


J s exp (~B 2 i {6 - x jq2 ) 2 ) (l + (b 2 * (9 - s^J 2 )’) 2 B^dO 

r°° 9 

/ exp (—u 2 ) (l + u 2s )~ du 

Jo 


< 


< - + 21 S 


2T 2s 


so that 


[ tyjqus (°) i’jq 2 -,s (6)\ dd < (c") 2 ? 7 exp (~B 2j (, x jqi - x jq2 ) 2 ) (l + B 2sj (x jqi - x jq2 ] 
J S i 

This concludes the proof. ■ 

Remark 4.2 For j sufficiently large and for 6 Xj q , there exist r > s and 
C T > 0 such that 




<C T 1 


^ (9~ X Jq ) 


Lemma 4.2 Let f>j q - s (•) be given by (Q). For j > 1, there exists C > 0 such 


that 


(1 r. 

E / IVw (*)| |Vw Wl |Vw (0)1 de < Cdr-Bi. 

n, 1 


91 , 92,93 = 1 


Proof. Let 7s(9o,r) the arc centered on (?o of length r. Hence, for any 6 £ 

3 ( x jrn ,B- J ) 

d „ 

E / l^9i; S (0)N^92; S (0)||^ 3 ; S (0)| d 0 

91 , 92,93 = 1 S1 

<e/, , (e 1 ^ 9 ;-will 

m \ q=1 J 


2 s 


17 






Observe that 


E l^g;« 

9=1 


< Cs^Bi exp ( — 

g=l 


'BHO-Xj 9 ) 



1 + 


< c s r]i B? + Cs-qB* E exp(- 

q—Xjk$SS{xjm,B-3) 

+ 


Now, in view of Remark |4.2[ there exists r > s such that 



E exp ( - 

q—Xjq$:Ss(xj m ,B-i) 


'Bi(0- Xjq ) , , (B'lO 



1 + 


V 2 


[}+(B 2j {o-x jq f)y T . 

As in ng, by using triangle inequality, for Xj q ^ 3 (xj mi B J ) , 9 £ 3 (x^g, B 


- c r E B 2 

q~x jq £'Ss{x jm ,B-i) 


we have 


(•£79 %jrri) “b X jq) — %jm) > 


E si( 1 + (B«(0- a;i ,) 2 ))' 

9:=Xj,^St(xj m ,B-J) 


< E st(^'fe ro -^) 2 ) T 

q~Xj q gSS(x jm ,B-i) 

< y _^_/ 

q: =x, q mx jm ,B-^ P&to™ 1 *-’)) Mx dm ,B-i) 

< y _^_/ 

^ P ( X jm, B 3)) B~j) 

q:=Xj q £$S(xj m ,B J) J v JTn '’ J 

< c"Ri , 
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2 ^B i (jXjm x jq) ^ ^X 


2 r B5 


(b 2j (a:j m - a:) 2 ) dx 
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as in uni, Theorem 5.5. Therefore we have 


^|^ g;s (0)|<CVr?M 


9=1 


(17) 


which leads to 


E 


To complete the proof, we use to have 


^ (E j <10 < C-i/hi 


/ d \ 3 d d d 

/ El^WWl de = E / K'9i;s( 0 )| E l^'9; S ( 0 )l E 

•^ §1 \ 9=1 / 91 = 1 “'® 1 92 = 1 93 = 1 

d r. 

< C 2 T r)B>Y, / |VwWl^ 

<n=i Jsl 


< (%T}BS'52\\il, jqi . a \\ 


91=1 

< C 2 dr)^Bi . 


5 An application: nonparametric density esti¬ 
mation 

In this section, we will present a practical application in the framework of non¬ 
parametric thresholding density estimation. The thresholding techniques, intro¬ 
duced in the literature by m , have become a successful tool in statistics, used 
in many research fields, cfr. the textbooks (25Jj35]. The asymptotic result here 
established are related to random vectors on the unit circle assuming the form 
(in the “de-Poissonized” case) 

■ 

Consider now a set of random circular observations {Xi £§':! = !, ...,n} 
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with common distribution v (6) = F (6) dd. Let us introduce the threshold 
Cj q ( T n) '■= L| | >KT j , where n is a real-valued positive constant to be 
chosen to set the size of the threshold (cfr. [5j ; l8j): the thresholding density 
estimator is given by 

J n Qi 

f(»)=EE Cfc (r„) I3 jq ^ jq ;s (0) , e G s 1 . ( 18 ) 

i=Jo 9=1 

where r n = as usual in the literature (see for instance u m na na). 

Further details on this topic can be found in |18j . Finite-sample approximations 
on the distribution of the coefficients /3j q . s can be useful to fix an optimal value 
of the thresholding constant k, by using a plug-in procedure built as follows 

1. Fixed a resolution level j *, the finite-sample approximations on the dis¬ 
tributions of the coefficients /3 •„ can be establish explicitly their corre¬ 
sponding expected values and variances. 

2. Using those informations, an optimal threshold nr n can be built. 

3. Study the nonparametric density estimator F with the optimal threshold. 

More in details, r„ depends on R t: while k can be built on the value of 
sample expected values and variances. In particular, observe that for the cut-off 
frequency J n = Jn t , usually chosen such that B jR t = yjRt/ log (Rt), we have 
that 

6 Numerical results 

In this section we will present some numerical evidences obtained by simula¬ 
tions on CRAN R-code. Observe that these results, obtained in a finite sample 
situation, can be considered just as a qualitative check of the main theoret¬ 
ical achievements here proposed. We develop a procedure to build Mexican 
coefficients, in the univariate case, according to the following guidelines: 

1. the distribution on S 1 is uniform, such that for any j, q 

0 , 

r ( 2 *+5) , 

2 (2s-t-i) 


bjq;s 
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j=10,t=50 j=20,t=50 j=30,t=50 



Figure 3: histograms of estimates of Mexican needlet coefficients corresponding 
to various choices of j and t. 

2. the intensity of the Poisson process is given by Rt = R ■ t, with R = 10 
and t = 50,100,150. 

3. the needlets taken into account corresponds to the resolution levels: j = 
10, 20, 30, while we fixed B = 1.3, Xj q = ir and s = 3. 

In Figure [3j each histogram describes the normalized Mexican coefficients 
built after the iteration of N max = 500 simulations, combining j = 10, 20, 30 and 
t = 50,100,150. Observe that they attain fastly the Gaussianity, as confirmed 
in Table 1 by the results of the Shapiro-Wilk test. Indeed, the test statistic W 
is closer to 1 for growing t and it is slowing decreasing as j increases. Further¬ 
more, p-values increase strongly with t. As a counterexample, in Figure [4] we 
describe the distribution corresponding to the case t = 5, j = 40, on which the 
Gaussianity does not seem to be attained, as confirmed by the Shapiro-Wilk 
which gives as result W = 0.9719, p-value = 3.207 • 10 -8 . 

Acknowledgements - The author whishes to thank D. Marinucci for the useful 
discussions and E. Calfa for the accurate reading. 
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